A CUDA SIMT Interpreter for Genetic Programming

نویسنده

  • W. B. Langdon
چکیده

A Single Instruction Multiple Thread CUDA interpreter provides SIMD like parallel evaluation of the whole GP population of 1 4 million RPN expressions on graphics cards and nVidia Tesla T10P. Using sub-machine code GP a sustain peak performance of 212 billion GP operations per second (3300 speed up) and an average of 4.5 peta GP ops per day is reported for a single card on a Boolean induction benchmark never attempted before, let alone solved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Soft Computing Preprint Graphics Processing Units and Genetic Programming: An overview

A top end graphics card (GPU) plus a suitable SIMD interpreter, can deliver a several hundred fold speed up, yet cost less than the computer holding it. We give highlights of AI and computational intelligence applications in the new field of general purpose computing on graphics hardware (GPGPU). In particular we survey genetic programming (GP) use with GPU. We give several applications from Bi...

متن کامل

Graphics processing units and genetic programming: an overview

A top end graphics card (GPU) plus a suitable SIMD interpreter, can deliver a several hundred fold speed up, yet cost less than the computer holding it. We give highlights of AI and computational intelligence applications in the new field of general purpose computing on graphics hardware (GPGPU). In particular we survey genetic programming (GP) use with GPU. We give several applications from Bi...

متن کامل

A Many Threaded CUDA Interpreter for Genetic Programming

A Single Instruction Multiple Thread CUDA interpreter provides SIMD like parallel evaluation of the whole GP population of 1 4 million reverse polish notation (RPN) expressions on graphics cards and nVidia Tesla. Using sub-machine code tree GP a sustain peak performance of 665 billion GP operations per second (10,000 speed up) and an average of 22 peta GP ops per day is reported for a single GP...

متن کامل

High Performance Manycore Solvers for Reservoir Simulation

SUMMARY The forthcoming generation of many-core architectures compels a paradigm shift in algorithmic design to effectively unlock its full potential for maximum performance. In this paper, we discuss a novel approach for solving large sparse linear systems arising in realistic black oil and compositional flow simulations. A flexible variant of GMRES (FGMRES) is implemented using the CUDA progr...

متن کامل

A simulation optimization method for internal trucks sharing assignment among multiple container terminals

Owing that the internal trucks (ITs) are frequently used as transportation equipments between yards and quaysides, the transportation efficiency of ITs secures a crucial position in container terminal productivity. Hence, a container terminal cannot contain a big number of ITs. As such, it is an imperative to explore an appropriate IT assignment strategy. Specifically for those container termin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009